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On Quantum Capacity of Compound Channels 
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In this paper we address the issue of universal or robust communication over quantum channels. 
Specifically, we consider memoryless communication scenario with channel uncertainty which is an 
analog of compound channel in classical information theory. We determine the quantum capacity of 
finite compound channels and arbitrary compound channels with informed decoder. Our approach 
in the finite case is based on the observation that perfect channel knowledge at the decoder does 
not increase the capacity of finite quantum compound channels. As a consequence we obtain coding 
theorem for finite quantum averaged channels, the simplest class of channels with long-term memory. 
The extension of these results to quantum compound channels with uninformed encoder and decoder, 
and infinitely many constituents remains an open problem. 



I. INTRODUCTION 



The determination of capacities of quantum channels 
in various settings is one of the most fundamental tasks 
of quantum information theory. The classical capacity of 
memoryless quantum channels has been identified in sem- 
inal work by Holevo and Schumacher and Westmore- 
land [2j . A much sharper version of that coding theorem 
was obtained subsequently and independently by Win- 
ter Q who proved the strong converse and achievability 
with von Neumann measurements, and Ogawa and Na- 
gaoka Q who gave another proof of the strong converse to 
the coding theorem. Concerning quantum capacity, Lloyd 
Q provided a strong heuristic evidence that the quantum 
capacity of quantum memoryless channels is given by the 
regularized coherent information. The follow-up work of 
Shor [|| and Devetak Q gradually removed the residual 
ambiguity and culminated in a rigorous proof of the di- 
rect part of the coding theorem for memoryless quantum 
channels in [7[ ■ The converse coding theorem was already 
done by Barnum, Knill, and Nielsen j8j. 
The underlying assumptions on the quantum communi- 
cation model considered in that fundamental work are a) 
that the quantum channel is perfectly known, and b) that 
there are no correlations between successive channel uses, 
i.e. the channel is memoryless. Both postulates are rarely 
fulfilled for real world communication systems. E.g. it is 
practically impossible to determine all parameters the 
channel depends on with infinite precision. Thus, we are 
forced to design protocols which work well for communi- 
cation scenarios with channel uncertainty. Similarly, it is 
definitely too optimistic to presuppose that the channel 
under consideration operates in a memoryless fashion. 
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In this work we relax the first assumption while sustain- 
ing the memoryless character of the communication. Our 
model can be described as follows: We are given an arbi- 
trary set 3 of memoryless quantum channels. Both sender 
and receiver merely know that the actual channel belongs 
to this set. Consequently, they are forced to find and use 
codes that are reliable for the whole set 3 of channels. 
This is exactly the quantum analog of compound chan- 
nels from classical information theory. Moreover this is a 
channel version of the universal quantum data compres- 
sion which has been introduced by Jozsa, M., P., and R. 
Horodecki Q. 

The main results of the paper are: 

1 . The determination of the quantum capacity of com- 
pound channels in the case |J| < oo. Here we first 
prove the coding theorem for finite 3 assuming that 
the decoder knows which channel is in use (in- 
formed decoder for short). Then we show that each 
rate achievable in this situation is also achievable 
without such prior knowledge. As a consequence we 
can determine the quantum capacity of finite aver- 
aged channels. Eventually, we determine the capac- 
ity of quantum compound channel with informed 
encoder in the case \3\ < oo. Denoting the quantum 
capacities of 3 with informed encoder by Qie(3), 
with informed decoder by Qm{3), and without any 
prior information by Q(3), our results in this part 
of the paper can be roughly summarized by 

Q{3)=Qid{3) < QieO). 

It can be shown, by generalizing an example given 
by Blackwell, Breiman, and Thomasian [1J|, that 
the inequality is strict in general. 

2. We prove the coding theorem for arbitrary 3 with 
informed decoder. Unfortunately, the techniques we 
employ do not allow an extension of the result for 
finite sets of channels to arbitrary 3 without chan- 
nel knowledge at the decoder. 
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Related Work 



Notation and Conventions 



The capacity of compound channels in the classi- 
cal setting was determined by Blackwell, Breiman, and 
Thomasian [l(| and Wolfowitz [ll|, [ljj ■ It was Wolfowitz 
[ill OjJ who realized the importance of channel knowl- 
edge at the encoder and the decoder of the compound 
channel and its theoretical implications. He showed, that 
the assumption of informed decoder does not imply a 
higher capacity of the compound channel. His approach 
was to establish first the direct part for the compound 
channel without channel knowledge at the decoder and to 
show that the upper bound of the converse part with in- 
formed decoder coincides with the lower bound obtained 
in the first step. So, our proof strategy described above 
is exactly the reverse to that of Wolfowitz. 
In a beautiful paper [l3| Datta and Dorlas determined 
the classical capacity of finite averaged quantum chan- 
nels, a channel model which is equivalent to finite quan- 
tum compound channels, i.e. |J| < oo. Somewhat later 
an independent approach in the finite case was given in 
14 1 by Farkas. Subsequently, two of us [l5[ were able to 
identify the classical capacity of arbitrary quantum com- 
pound channels and, as a consequence thereof, the clas- 
sical capacity of arbitrary averaged channels. Recently 
Hayashi [l6j obtained a similar result with a completely 
different proof technique. He used the Schur-Weyl duality 
and the packing lemma from (l7j . 



B. Outline 

The paper is organized as follows: Section HT1 provides 
the necessary definitions concerning compound channels 
and codes in scenarios with and without informed de- 
coder. The following section [TTT1 contains a modified ver- 
sion of Klesse's 18] one-shot coding result which applies 
to finite arithmetic averages of quantum channels. Since 
good codes for such averaged channels are also good for 
quantum channels that are averaged over we obtain the 
basic building block of the coding theorem for finite com- 
pound channels with informed decoder which is derived 
in section HVl Moreover, using the elegant channel estima- 
tion strategy developed by Datta and Dorlas [l3| we are 
able to convert the codes for finite quantum compound 
channels with informed decoder into codes that do not 
require such prior information at the decoder. Section 
IIVI concludes with the coding theorem for finite quantum 
compound channels with informed encoder. 
Discretization arguments via r-ncts and some simple ap- 
proximation results concerning the coherent information 
and entanglement fidelity are presented in section [V] 
They are the basis for section I VII in which we carry out 
the extension of the capacity result with informed de- 
coder to the case of arbitrary set 3. 



All Hilbert spaces are assumed to have finite dimension 
and are over the field C. S(TL) is the set of states, i.e. 
positive semi-definite operators with trace 1 acting on 
the Hilbert space TL . The set of completely positive trace 
preserving (CPTP) maps between the operator spaces 
B(TL) and B(/C) is denoted by C(H,fC). C L (TL,K.) stands 
for the set of completely positive trace decreasing maps 
between B(TL) and B(JC). Throughout the paper we use 
base two logarithms which are denoted by log. The von 
Neumann entropy of a state p £ S(TL) is given by 

S(p) := -tr(plogp). 

The coherent information for AT £ C(TL, JC) and p £ S(TL) 
is defined by 

I c (p,Af) := S(Af(p)) - S((id n ® AO(MM)), 

where ip £ TL gj TL is a purification of the state p. 
A useful equivalent definition of I c {p,N) is given in 
terms of M and the complementary channel £: Due to 
Steinspring's dilation theorem J\[ can be represented as 
M{p) = tv He {vpv*) for p £ S(TL) where v : TL -> K®TL e 
is a linear isometry. The complementary channel £ £ 
C(TL, TL e ) to N is given by 

£(p):=tr n (vpv*) (p £ S(TL)). 

The coherent information can then be written as 

I c ( P ,M) = S(JV(p))-S(£(p)). 

As a last notational issue we recall the definition of en- 
tanglement fidelity. For p £ S(TL) and N £ C L (TL,K.) it 
is given by 

F e (p,Af) := (iP,(id n ®Af)(\i))(ip\)Tp), 

with a purification ip £ TL <£> TL of the state p. 



II. 



CODES AND CAPACITY 



Let 3 C C(7i,/C). The memoryless compound channel 
associated with 3 is given by the family {N® 1 : S(TL® 1 ) — > 
S(/C® i )}; e N,A/'e:j- In the rest of the paper we will write 
simply 3 for that family. 

Most of the time we will deal with compound channels 
with informed decoder, i.e. we suppose that the receiver 
knows which channel is in use. The definition of codes in 
this situation is as follows: 

An (I, ki)— code for 3 with informed decoder consists of 
a subspace Ti C TL® 1 with ki = dimJF; and a family of 
CPTP maps {K l N : S(K,® 1 ) -> 

A nonnegative number R is called an achievable rate for 
3 with informed decoder if there is a sequence of (I, ki)- 
codes such that 
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1. liminfj^oo jlogfc; > R, and 

2. lim^eo inf^ e a F e fe t , K l M o TV®') = 1 

hold with -kj= 1 := tr ^. ^ where is the orthogonal pro- 
jection onto Ti. 

The capacity Qid(3) of the memory less compound chan- 
nel 3 with informed decoder is given by 

Qid{3) ■— sup{i? £ R + : R is achievable for 3 

with informed decoder}. 

The definition of codes in the situation of an informed 
encoder is straightforward: 

We endow the encoder with a set of subspaces, one for 
each channel in 3. 

An (l,ki)— code for 3 with informed encoder thus con- 
sists of subspaces {-T^j./Vea with T^f c Ti.® 1 and fc; = 
dim^"/^ for all N £ 3 and a recovery operation 1Z 1 : 

A nonnegative number R is called an achievable rate for 
3 with informed encoder if there is a sequence of (I, ki)- 
codes such that 

1. liminfj_,oo ylogfcj > R, and 

2. lim^eo huVea F e (n^,TZ l oAf® 1 ) = 1 

hold with TTyrA- := tT t p l where pjrN i s the orthogonal 
projection onto J r / V '. 

The capacity Qie(3) of the memory less compound chan- 
nel 3 with informed encoder is given by 

Qie{3) := sup{i? £ R + : R is achievable for 3 

with informed encoder}. 

Codes and capacity for the memoryless compound chan- 
nel 3 with uninformed encoder and decoder are defined 
in a similar fashion. The only change is that we do not 
allow the recovery operation to depend on M £ 3. I.e. An 
(I, ki)— code for 3 is a pair (F^IZ 1 ) where T\ is a sub- 
space of H m with ki = dim^i and K l £ C(K.® l ,H® 1 ). 
A nonnegative number R is called an achievable rate for 
3 if there is a sequence of (I, fc/)-codes such that 

1. liminf^oo jlogfc; > R, and 

2. lim^oo huVea F e {n^ , K l o J\f® 1 ) = 1. 

The capacity Q(3) of the memoryless compound channel 
3 is given by 

Q(3) := sup{i? € R + : i? is achievable for 3}. 

It is clear from the definitions given above that 

Q(3)<mm{Q IE {3),Q ID (3) (1) 

holds for each 3 C C(H,K). 

Throughout the paper we will use the close relationship 



between the quantum compound channels with \3\ < oo 
and (finite) averaged channels. The latter are described 
as follows: Given any finite set 3 = {A/i, . . . Nn} C 

C(H,JC), real numbers Ai, . . . , Xn > with J2iLi ^< = 1, 
and I £ N we set 

N 

Af l :=^A;A/f'. 

i=l 

The averaged channel SI^a associated with 3 and A := 
(Ai,...,Aiv) is then the family {M l : S(H® 1 ) -> 
5(/C®')}; e N- The codes and capacities, Qid(Qi,\) and 
Q(21j,a), with or without informed decoder are defined 
in a similar fashion as for compound channels. 
Clearly, an (I, fc;)-code for 3 with informed decoder and 

F e ^ Fl ,n\oNf)>l~e Vi£{l,...,N} 
implies 

n 

Fefo^KRlo^^l-e. (2) 
t=l 

Conversely, ^ implies 

F e (7r^,^{oj^')>l-f V<G{1,...,JV}. 

Similar relation holds in the case of uninformed decoder. 
Thus we are led to the following simple consequence 
which will be used freely in the rest of the paper. 

Lemma II. 1 Let 3 = {A/i, ■ ■ -A/jv} C C(H, fC) and real 
numbers Ai, . . . , Aj\r > with X)i=i Ai = 1 6e given. Then 

Qid(3)=Qw(%3,x) and Q{3) = Q(Sb, A ). 



III. ONE-SHOT CODING RESULT 

Our coding result will be based on a one-shot result 
which is an extension of that for a single channel ob- 
tained in [l8j by Klesse. A similar approach for a single 
channel is given in the paper yj)j by Hayden, Horodecki, 
Yard, and Winter. 

We start with subspaces T C Q C Ti of a given Hilbcrt 
space Ti. and states wj? := p/ti(p) and irg := q/tr(q) 
where p and q are the projections onto T and Q respec- 
tively. Recall the definition of the code entanglement fi- 
delity as 

F c ^(jijr,M) := max F e (n r , K o M), 

■R.€C(K.,H) 

where we admit any completely positive trace decreasing 
(CPTD) map Af : B(H) -> B{K). The set of completely 
positive trace decreasing maps is denoted by C^(H,IC). 
Klesse's main result in can be stated as follows: 
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Theorem III.l (Klesse (T^] ) Let the Hilbert space TL 
be given and consider subspaces J- C Q C TL with 
dim.F = k. Then for any Af G C^(TL,IC) allowing a rep- 
resentation with n Kraus operators we have 



F ce {un r u*,Af)du > tr{Af{iTg)) - Vk^n\\Af{Trg)\\ 2 , 
U(<?) 

where il(G) denotes the group of unitaries acting on Q 
and du indicates that the integration is with respect to 
the Haar measure onii(Q). 

Our main goal in this section will be a variant of Theorem 
IIILll which applies to finite arithmetic averages of CPTD 
maps. 

Let Wi, . . . ,Af N G C l {TL,JC) and set 

N 



As in Theorem IIII.ll we consider subspaces J- C Q C TL 
and states 7i> and irg. 

Let ip £ TL a ® TL, TL a — TL, be a purification of njr and 
consider a Stinespring dilation of the channel Af given by 



Af( • ) =tr w .((l w ®p e M • K), 



(3) 



where TL e is a suitable finite-dimensional Hilbert space, p e 
is a projection onto a subspace of TL e , and t> : TL — » JC®TL e 
is an isometry. 

Let us define a pure state on H a ® /C ® 7i e by the formula 



$ := 



1 



We set 

p' := tr Wa8W .(|V')(^|), P ae := *k(WW\), 

and 

Pa := tr«(MM), P' e := trw.aJcW) (V'l)- 

We borrow the following lemma from [TJ|, which will 
be the basis of the result we are going to derive in this 
section. 

Lemma III.2 (Cf. QllBjij]) For any Af € C l (H,IC) 
there exists a recovery operation 1Z € C(IC,TL) with 

F e (p,TZoAf) >w - \\wp' ae - wp a ® p' e \\i, 

where w — tr(Af (n ?■)) . 

The next, simple lemma is needed in the proof of theorem 

ELI 

Lemma III. 3 Let L and D be N x N matrices with non- 
negative entries which satisfy 



Lji < Ljj, Lji < L u , 



(4) 



and 

Dji < max{Djj,D u } 
for all j,l E {!,..., N}. Then 

N N 

E ^\^7^7- -'Ex 1 ^ 

3,1=1 3=1 

Proof. Note that ([5]) implies 



(5) 



n.i < D rj + D u . 



Therewith we obtain 

N 1 V 
N 



3,1=1 



E ^VLjD~i < E ^\/ L ADn+D H ) 

3,1=1 
N 1 

< E ^\ i-j. n ,- ■ L " n ' 



(6) 

(7) 
(8) 



3,1=1 
N 



< 



E jf (\ T P J ^ + v / ^A^)(9) 



TV 

= " E \ l -iJ ,) .n- 

3=1 

where in ([7]) we have used © , in © we employed ((3]) , and 
<j9j) holds because Va + b < yfa + Vb for all non- negative 
real numbers a. b. 



□ 



Our main result in this section can be stated as follows: 

Theorem III. 4 Let the Hilbert space TL be given and 
consider subspaces T C Q d TL with dim J 7 = k. For any 
choice of J\fi, . . .JVn € C^(TL, /C) each allowing a repre- 



sentation with rij Kraus operators, j = 1, 



, N, we set 



1 N 

^ : =^E^ 



3 = 1 



Then 



F Cte (wrjru*,Af)du > tr[jV(ng)) 



U(G) 



-2Ev^ii^(^)ii 2 . 

J=l 

Proof. We can assume without loss of generality that the 
numbering of the channels is chosen in such a way that 
n% < ri2 < • • • < tin holds for the numbers of Kraus 
operators of the maps ATi, . . . ,jVn- From lemma ITU. 2 1 we 
know that there is a recovery operation 1Z such that 

F e (wr,TlojV) >w-\\wp' ae -wp a ®p' e \\ u (10) 
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where we have used the notation introduced in the para- 
graph preceeding lemma llll.2l 

For each j 6 {1 N} let {aj^}^ be the set of Kraus 

operators of TV}. Let {/i, . . . , /at} and {ei, . . . , e„ N } be 
arbitrary orthonormal bases of C N and C™ N . Let the pro- 
jection p e and the isometry v in |3]) be chosen in such a 
way that for each <f> £ TC the relation 



N rij 



.7 = 1 i=l V 



aj,i0) ® ej® fj, (11) 



holds. For a purification tp e H Q (8 H of the state 7Tjr we 
consider a Schmidt representation 



i fc 

-7= X! h ™®9™, 

* m=l 



with suitable orthonormal systems {/ii, . . . , and 
{<7i, . . . , A calculation identical to that performed by 
Klesse [18| shows that the states on the right hand side 
of (fT0|) can be expressed with the help of representation 
(fTTj) as 



fc Z/ 

j,£=l z,r=l s,i=l 



w Pae 



N 



with x s ,ij := h s ® a® fj, and 



|*^s,«,j) {^t.r.l | ) 

(12) 



" tr(aj j7r^-a| * r ) 

WPa®P e = 2^ 2^ fcjv ' Pa® \Vi,j){yr,ll (13) 



j,l—l i,r—l 



where t/jj := ei® fj. 

If we perform the unitary conjugation induced by the uni- 
tary map x s ^ij = h s eg) ei <£> /j i— > a^jj = g s ® ei ® fj fol- 
lowed by the complex conjugation of the matrix elements 
with respect to the matrix units {\x' g ij)(x' t k i\} s ,i,j,t,k,l 
we obtain an anti-linear isometry / with respect to the 
metrics induced by the trace distances on the opera- 
tor spaces under consideration. A calculation identical 
to that in [l8[ shows that under this isometry the sub- 
normalized states in (fT2"]) and (fTU|) transform to 

AT rij ,m 

Hwp'ae) = T7f Z Z P a l^l,rP®\yi,3){yr,ll (14) 



3,1=1 i,r—l 



and 



^ N rtj,n 

i{ wPa ® P ' e ) = - ^2 



- teipaj^rp) 

p®\yi,j)(yr,i\, 



j,l=l i,r=l 



kN 



with p = knjr and y 



ei ® /j for j = 1, 
z = l,...,7ij. In summary, using the isometry /, (|14p . 
and (|15| the inequality (flQ|) can be formulated as 



F e (Trr,KojV)>w-\\D(p)\\ u 



(16) 



with w = tr(A/"(7r^)) and 



j,/=l z,r=l 



where 



D (iMrl)(p) : = £ fw.W.rP - ^ tr (P a j,i a l,rP)PJ ■ 



Let us define 



rij,ni 



DM-.= J2 D m(kl){p)®h)(ek\®\fj)(fi\- (17) 



-l,k=l 



The triangle inequality for the trace norm yields 



||0(p)||l < £ ^H^WIIl 

r 1 / 

W 1 / 

3,1=1 

where the second line is justified by the standard relation 
between the trace and Hilbert-Schmidt norm, ||a||i < 
Vrf||a||2, d being the number of non-zero singular values 
of a. 

In the next step we will compute ||-Dj,;(p)|||. A glance at 
(Tr7|) shows that 



(19) 



i=i, fc=i 



and consequently we obtain 

IPi,j(p)lla = tr((^- i (p))*^-, i (p)) 

= E ^((^(^(fcofp))*- ^)^)^)) 



i=l,r=l 

rij ,ni 



-2 £ {tr(p(a^a iir )*pa^ai >r ) 



A- 2 



i=l,r=l 
T l tr (P a i,i«i,r-)| 2 }- 



(20) 



Let U be a random variable taking values in il(Q) accord- 
ing to the Haar measure of il(^). Then we can infer from 
HH1> that 



(15) N i / 

N and E(\\D(UpU*)\\x) < ^ V ^ E (ll^,KW*)lll 



(21) 

where we have used the concavity of the function y/~ 
and Jensen's inequality, and moreover, we abbreviated 
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k min{ny, m \ by Lji . Now, starting with (|20[) and arguing 
as Klesse [18} we obtain that 

mDj.Aupunwi) < trMfaWfa)) 

= (Nj(Trg),M(wg))HS, (22) 

where ( ■ , ■ )hs denotes the Hilbert-Schmidt inner prod- 
uct. Similarly 



E(tr(Af(UTTjrU*))) = tr(AA(7r e )). 



(23) 



Now, using (10]), dTHj), d20J) , d2Tj) , |22]), and ([23]) we arrive 
at 



w i 

E^V 71 ^ (24) 



where for j,l € {1, . . . , iV} we introduced the abbrevia- 
tions 



Lj7 = fcmin{nj, n;}, 



and 



It is obvious that 

Lji < Ljj and Lji < L u 

hold. Moreover, the Cauchy-Schwarz inequality for the 
Hilbert-Schmidt inner product justifies the following 
chain of inequalities 

Dji = Wito)Mii*g))H8 

< mto)MWi(*o)\\2 

< max{||^(7r e )||I,||M(7rg)||l} 
= max{Djj,Du}. 



Therefore, an application of Lemma IIII.3I allows us to 
conclude from (l24l) that 



R(F Bte {U*rU*,M)) > tr(AA(7r g )) 



N 



-2^^11^(^)112, 



which is what we aimed to prove. 



□ 



IV. DIRECT PART OF THE CODING 
THEOREM: FINITE CASE 

In this section we will prove the coding theorem for fi- 
nite compound channels. This is done in subsection IIV CI 
after recalling some well-known properties of frequency- 
typical subspaces and Kraus operators in subsections 
llt^ and lTTBl we 



A. Typical Projections 

In this subsection we will collect some well-known re- 
sults on typical projections. 

Let p £ S(Tl) be a state and consider any diagonalization 

d 

P = ^h\ei)(e-i\, 

i=l 

where d := dim7i. Using this representation the state 
p® 1 can be written as 

p ®i = \ x i\e x i)(e x i\, 

with A = {l,...,d}, x l := (x\,...,xi) £ A 1 , X x i :— 

X Xl ■ ... ■ X Xn and e x i := e Xl ® . . . ® e Xl . 

The frequency-typical set of eigenvalues of p is given by 

T s ,i := {x l £ A 1 : \\p x , - X\\i < 8, p x , « A}, 

where p x i denotes the empirical probability distribution 
on A generated by x , i.e. 

\{je{l,...,l}:xj=x}\ 



Px' (x) 



I 



X is the probability distribution on A defined by the 
eigenvalues of p, and p x i <C A means that p x i (x) = 
whenever X x = 0. 

The frequency-typical projection qgj of p given by 
Qs,l ■= E l^ 1 )^'! 

has the following well-known properties: 

Lemma IV. 1 There is a real number c > and a func- 
tion h : N — > R+ with h > and lim/—^ h(l) = 
such that for each 5 £ (0,1/2) there exists a number 
ip(5) > 0, with lima-fO <p(8) = 0, and for any p £ S(H) 
the frequency-typical projection qs.i £ B(TL)® 1 satisfies 

1. tr(p® l q Sil ) > \-2- li - c&2 - h ^\ 

2. 2- , W'>+*W)( toiI < q S ,iP® l q S ,i < 2"™-^))^, 
and 

3. m (5)2 l(s ^-f( s )) < tr{q 5t i) < 2 1{ - S ^+^ where 

m {5) : =l- 2 -^ 2 -M0). 

The rightmost inequalities in 2. and 3. imply 

\\q 5 ,lP^q 5 M<^ mp) -" V{5)) - 
Moreover, ip(8) and h are given by 

h{l) = j log(Z + 1) VI £ N 



<p(S) 



-Sloe 



The proof of the lemma is fairly standard and rests on 
purely classical reasoning. It combines the Bernstein- 
Sanov trick fcf. | 20 | . sect. III. 1) and the type counting 
methods from [17j . 
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B. Typical Kraus Operators 

According to Kraus' representation theorem we can 
find to any Af G C(H.,fC) a family of operators 
Oi, . . . , a n e B(H, K.) with J27=i a * a i = aim 

i=l 

for all peS{H). 

We fix the maximally mixed state 7rc_supported by the 
subspace Q oiTC. It is easily seen (cf. [2l|) that the Kraus 
operators a\ , . . . , a n of AT can always be chosen such that 

tr(a l 7r g a*) = 5ytr(a*7rga?), 

for alH, j € {1, . . . , n}. With this choice of Kraus opera- 
tors we can define a probability distribution r on the set 
£:={l,...,n}by 

r(i) := tr(aiTTga*), (i € B). 

It is shown in [22j that the Shannon entropy of r is noth- 
ing else than the entropy exchange S e (Trg,Af), i.e. 

H(r) = S e (%g,JS). 

In a similar vein as in the previous subsection we intro- 
duce the notion of frequency-typical subset for r, i.e we 
set 

K s ,i := W G B l : \\p v i -r\\x< S, p y , < r} 

with S > 0. With this we can introduce the notion of the 
reduced operation by setting 

Ns,l{p) '■= £ a v l P a *y^ ( 25 ) 

where a y i := a yi ® • ■ • <8> a yi and p G S(H® 1 ). Moreover, 
we set 

ns,i := \K S j\, 

which is the number of Kraus operators of the reduced 
operation A/a,;. The properties of frequency- typical sets 
(cf. [UHS])) lead immediately to 

Lemma IV. 2 Let S G (0, 1/2), I G N, and fix the maxi- 
mally mixed state ng on the subspace QofH. There is a 
real number ^(6) > with lim^o "f(S) = such that for 
each Af G C(H.,1C) the reduced operation Afs.i satisfies 

1. tr{Af8,i{irf)) = r & {K 5l i) > 1 - 1~ l{c ' 5 '' ''~ h with 
a universal positive constant c' > 0, 

2. n s ,i < 2'( s e(-s^)+7(5)). 

The function hf : N— > M+ is given by h'(l) = ^ log(i+l). 



C. The Direct Coding Theorem for Finitely Many 
Channels 

Let us consider a compound channel given by a finite 
set 3 :— {A/"i, . . . , A/at} C C(H.,JC) and two subspaces 
£i,G® 1 of Tt® 1 with £i c g m G H m . Let h := dim£/ 
and consider the associated maximally mixed states 7T£ Z 
on Ei and ~ng on 

For j G {1, . . . , A} and states A/}(7rg) let q jA i G £(/C) 0i 
denote the frequency- typical projection of Afj(~Kg) defined 
in Sec. II V Al for 5 > and I G N. Moreover, let Mj,s,i be 
the reduced operation associated with Afj(irg) for j G 
{1, . . . , A}, (5 > 0, and !eNas considered in Sec. IIVBI 
To every projection q on H® 1 , associate the operation 
Q G &(H® l ,H® 1 ) defined by Q( ■ ) := q ■ q and let q- 1 
denote the ortho-complement of q. 

In the following, for the compound channel 3 and every 
I G N, we will consider the average code fidelity 

1 N 

FceAm, , 3) := max - V F e (ir £l , K K o Af® 1 ) 

2 — 1 

where it is understood that TZj^ i are recovery operations. 
Note that for every e G (0, 1), F c _ e: d(TT£ l ,3) > 1 — e implies 
the existence of an (I, k{) code with informed decoder for 
3 that satisfies F e (n£ n Hj^. o Afi) > 1 — A • e for every 
" {1 V], 

For given £ > 0, j G {1, A} and / G N we define 

A/j( • ) := Q jA! o^ Ai ( • ) 
and, accordingly, 

1 N 

i=i 

We show that, for every j G {1, A}, 

F c , e (n £l ,Aff )>F C) e(7r £l ,A/})- (26) 
Let 7?. G C(JC® 1 ,7i® 1 ) be any recovery operation. Then 
^,e(^,A/f ) > F e (7r £i ,^o(Q jAi + Q^)oAA^) 

> F e {n £l ,noMj). 

The last inequality follows from the fact that Nj is a 
reduction of Qj,&,i . Taking the maximum over all 
recovery operations 1Z proves the claim. 
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Using (|26p. we get the lower bound 



1=1 

1 N 



N ■ 

i=l 

> F Cte (ir £l ,Af) 



(27) 



Note that the inequality (|27|) is still valid if we replace 
ir £l by uir £l u* for any u G 11(5®'). Consider 

F e (a,3f) :=E(F C) e,rf(C/7r £l C7*,30) 

with a random variable U taking values in ii(Q® 1 ) and 
which is distributed according to the Haar measure. From 
(g7J) it follows that 



F e (0,O) >E(F c , e (Uw £l U*,ti)). 



(28) 



Applying Theorem [TTL4] to the right hand side of (f2"5|) we 

arrive at 

F e (a,3f) > E(F c , e (U7T £l U*,Sf)) 
N 

-2Y,Vk^i\\M^f)h, (29) 
i=i 

where rij,^ stands for the number of Kraus operators of 
the reduced operation J\j . 

Theorem IV. 3 (Direct Part: Informed Decoder) 

Let 3 — {Afx, ...,A/jv} C C(H,K.) be a compound channel 
and Ttg the maximally mixed state associated to a 
subspace Q C H. Then 

Qid{3) > mm^I c {Tig,Ni). 

Remark. Arguing as Klesse [l8j|, an immediate conse- 
quence of this theorem is the inequality 



Qid(3) > lim - max min IJp,M^ 1 ). 



See also the argument in section fVIl Using the quantum 
Fano inequality and following the lines of [g], it is easy 
to establish the converse 

QidO)< lim- max min I c (p,N^ 1 ). (30) 

Proof. We show that for every e > the number 
min^ea I c (Trg,J\f) — e is an achievable rate for 3. 

1) If min^ea Ic(^g, A/i) — e < 0, there is nothing to prove. 

2) Let min^ea I c (ng,Afi) — e > 0. 

Choose (5 S (0, 1/2) satisfying j(6)+3<p(5) < e with func- 
tions if), <p from Lemma TlV. 1[ IIV."2l 



Pick a sequence of subspaces ("K £l )i£^ with ir £l < 
■Kg 1 for all I G N that are of dimension ki = 
^/(min^^/c^s^O-^j^ By s ^ > i^g^tf.) (see [3), 
this is always possible. 

Obviously, j logfc; < mm^ i63 I c {itg,Ni) - £■ 
We consider the terms in (|28p separately. 

1 W 

tr(A%f<)) = ? ^[tr(Q jAi o^' (Trf)) 



1 - 

^ ^E^A^f (<)) 



+tr((A/},,,-A/f )(<)] 



-tr( E aj,»"rf 
> 1 — 2~ l ( cS2 ~ h ^ — 2~ i ( c '< 52 ~' l 'W) 



(31) 



We used the fact that is a reduction of AA®' and 

Lemmas ITVjJIO 

Further, using the inequality \\A + B\\l > \\A\\% + |LB||| 
valid for nonnegative operators A,B G B(K.® 1 ) (see [13]), 
we get the inequality 



H\\ |2 



l t ; /1I2 



= ll?^A/f (ttJ 

< trfe- dV )2- 2 '( s « ! ^ ! ))-^(*)) 

< 2 -;(S(Af' ( 7 rf i ))-3 V (5))_ ( 32 ) 



The last two inequalities follow from Lemma TlV. II Com- 
bining the inequalities (|2"5|) . (f3"Tj) and (13"2"1) with Lemma 
IIV.21 we finally see that 

F e (g, 3) > 1 - 2" z(c,j2 "' l( ' )) - 2- i ( c ' ,52 - h 'W) 

AT 



-2V v / 2'(Ti°sfc. 



l + 1 (5)+3 v >(S)-I c (7vg,Af j ) 



This shows the existence of at least one sequence of (I, hi) 
codes for 3 with informed decoder and 

lim — logfc; = min I c {irg,Ni) - e 

l— »oo I Ni&3 
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as well as, for every I € N, 

' -F^f^KMoM® 1 ) >l-N-ei 



mm . 

TVs a 



with 



□ 

Our next goal is to convert the codes for the given, finite 
set 3 C C(H,IC) with the informed decoder into truly 
compound codes. To this end, we need the channel es- 
timation technique of Datta and Dorlas [l3| which we 
recall now: 

Theorem IV.4 (Datta & Dorlas fl3l ]) 

Let 3 = {Afi,...Af N } C C{H,K) with JV 6 N. Set 
L = {%). There is f = f(3) € (0,1) such that for 
each m £ N we can find mutually orthogonal projections 

Pl im L,...PN,mL £ B(JC® mL ) With J2ZlPi,mL = 1^ 

and a pure state a/ mi ) € S(7i® mL ) such that 

tr( Pl , mL A/? mi (a/ mi ))) > (1 - NTf' 1 
holds for all i G {1, . . . , -/V}. 



With the help of theorem IIV.4I we will show that knowl- 
edge of the channel in the finite compound scenario does 
not increase the capacity. 

Lemma IV. 5 For any finite set 3 = {Afi, . . .Wat} C 
C(H, IC) we have 



!ID 



(3) = Q(3). 



Proof. The inequality Qid(3) > QP) is obvious from the 
definitions of these capacities. 

The proof will be complete if we show that each rate 
R achievable within the scenario of an informed decoder 
is also achievable without knowledge of the channel. Let 
{Ft, {R-WiLi) be a (*> h)-code for 3 = {Mi, . . . ,M N } with 
informed decoder and 



F e (-Kr t ,nloAf®*)>l-s 



(33) 



for some e £ (0, 1) and alii € {1, . . . , N}. 

Let the pure state uy- mIj ' and projections 

Pi,mL, ■ ■ ■ ,PN,mL be as described in Theorem IIV.4I 

Take x mL € H® mL such that w^ mV > = \x mL )(x mL \ 

and define a set of measurement operations R" lL S 

C l(lC® mL ,H® mL ) by Rf L { ■ ) := ^ mL Hr{ PumL ■ p i>mL ). 

Set 



K mL+t :=^i?r L ®^, 



T' 



L+t 



and consider the averaged channel 



i N 

KfmL+t _ J_ >r®{mL+t) 

JV ^ 1 

i=l 



With the abbreviation 

F e := F e (n^ L+t ,n mL+t o M mL+t ) 
we arrive at 
1 N 



i=l 
N 



AT 



^ 4 E ^(^ ( " lL) . ° M® mL )F e (tt^ , 72* o A/f*) 
1 W 

> ¥ 5](x mL , W (™ i Hr(p J , mL M( W ( mi ))) a;mL )(l - e) 



> (i_ Ar/ - )J v-i (1 _ £)) 



(34) 



by theorem II V . 41 and (|3"3")l . We see from (|3"9")l that for each 

ie {1,...,JV} 

F e (7r^ ^ft^oA/f (roi+t) ) > l-JV(l-a(JV,m, £ ,/)) 

(35) 

with a(N, m, e, f) = (1- N / m ) iV - 1 (l-e). Replacing, for 
sufficiently large ( € N, m by [v^J , i by ^ — L[v^J , and 
£ by a sequence (extern with lim^oo e t — we obtain a 
sequence of (I, fc;)-codes (J-'^TZ 1 ) for 3 with 



lim 

2— >oo iG 



min Fei-Kf^ll 1 oNf) = 1 

{!,..., AT} I' 4 



and 



lim inf — log dim T[ = lim inf - log k t , 



which concludes the proof. 



□ 



Lemma TlV. Ill Theorem IIV.31 and (f3T)| immediately im- 
ply the following coding theorem. 

Theorem IV. 6 (Coding Theorem: \3\ < oo) For 

any finite set 3 — {Mi, . . -Mn} C C(7Y,/C) we have 

Q{3) = lim - max min I c (p,M® 1 ). 
i^oo I pes(n» l ) ie{i,...,N} 



Utilizing lemma Hi. II we obtain immediately the following 
capacity result for finite averaged quantum channels. 
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Corollary IV. 7 (Capacity of Averaged Channels) 

For any finite set 3 — {A/i, . . . A/;v} C C(H, JC) and any 
A = (Ai, . . . Ajv) G R N with Ai, . . . , X N > 0, A 4 = 1 
we have 

Q(a 3 , A ) = lim t max min I c (p,A/?')> 

where 2la,A denotes the averaged channel defined by 3 and 
A. 

We shall now briefly discuss the case of an informed en- 
coder for finite compound channels. 

Lemma IV. 8 Let ttc be the maximally mixed state on 
a subspace C C 7i® of dimension D and £ : B(H® 1 ) — > 
B(H® 1 ) such that 

F e {TT C ,£) > 1-6. 

To every K < D there exists a subcode C <Z C of dimen- 
sion dim C" = K such that 



F e (ir c >,£) > 1 



D 



Proof. Let P be the orthogonal projection onto C, that 
is 7rc = tr /p\ P. Set L := Lj^J- Take a decomposition 



i=l 

of C into pairwise orthogonal subspaces Cj C C that 
satisfies dimC^ — K, i E {1, ...,L} and dimC^+i = D — 
KL. 

To every Cj, let the maximally mixed state on Cj be 
denoted by 7TC;- Clearly, 



t=^K D-KL 
n c = 2^ + D 



Lemma IV.9 Let n £ N, A, B E R and A > B > 0. 

Then 



l2 nA \ 



[2" B \ ■ L&J 



< 1 - 3 • 2 



-nB 



Proof. Let, for the moment, n be fixed. There are 
S A ,S b ,Sab G (0,1) such that L2 nA J = 2 n ^ + £ A , [2 nB \ = 
2 nB + 8 B and Lj^jj = f^rJt + <W We will now de- 
rive a lower bound on the denominator. 



(2 nB + 6 B )( 



2 nA + 5 A 



Sab) > 2 



nB 



-tnA 



2 nB + 5 B 



2 n(A+S)_ 



> 2 ni - A+B ^-^- (37) 



■)nB 



With the help of ([37]) we get 

\2 nA \ _ 2 nA + 5 A 



\2™ B \ • LP|J (2" b + 6 B ){%£$£ + 8 AB ) 

< (2 nA + 5 A )2- n{A+B) (2 nB + 1) 

< (2 nA + \)2~ ni - A+B) {2 nB + 1) 
= (2"™ B + 2"" (/l+B) )(2 rli3 + l) 

2~n{A+B) 



1 + 2 



—nB i o— 



< 1 + 3-2 



□ 



Theorem IV. 10 (Converse: Informed Encoder) 

Let 3 = {7Vi, . . . ,Nn} C C(H,JC) be a finite compound 
channel. The quantum capacity with informed encoder 
of 3 is bounded from above by 

Qie{3) < min Q(A/}). 



By assumption and use of convexity of entanglement fi- 
delity in the input distribution, this implies 

1-e < F e (n c ,£) 

^ K , „. D-K L , 
^ T,n F ^,£) + p; F e (n CL+1 ,£) 



D 



K K 
< L— max FJir c ,£ )) + 1 - L — . 

D Ki<L K K ^'11 J) 



(36) 



Clearly, (|36|) implies that there exists j E {1,...,L-^-} 
such that 



1 - 



D 



L ■ K 



e<F e (ir Cj ,£)- 



Set C = C,. 



□ 



This is easily seen using standard techniques from [8j]. 

We will now show that our upper bound is indeed 
achievable. To this end, we will now convert codes for 
single channels into codes for 3 with informed encoder. 
Again, we need the channel estimation technique of 
Datta and Dorlas [l3| which has already been stated in 
Theorem [TO] 

Theorem IV. 11 (Direct Part: Informed Encoder) 

For any finite set 3 — {A/"i, ■ ■ .-A/at} C C(7i, JC) we have 

Qie{3) > mm Q(M). 

Proof. Throughout the proof, we will write 'for all V 
meaning 'for all i E {1, A}'. 

Let e > be arbitrary and {ng 1 , . . . Tg N } be maximally 
mixed states on corresponding subspaces Gi C H. By the 
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noisy channel coding theorem [la| . for every i there are 
,K\) lm , (e| l) ) ;eN with e ; W 



sequences (J^jTZ 1 -)^, (e^)i^ with e[ 4 ' \ such that 



dim .77 = y^h^^M)-^)^ 

where ei := maxi<i<Ar e\ and, clearly, ei \ 0. 
We set E := {i\I c {^g z -,.N'i) = mini<,<jv Ic(~kq 3 ,W})} and 
denote the complement of E within {1, N} by E^. We 
take k := min{i|i € E}. 

By definition, for every j £ E 1 ^ we have I c (Trg k ,ATk) — e < 
Id-Kg^Nj) ~ e. By lemmas ITOI and HV^l for j G £ C 
there exist subspaces .F/ with dim-Ff = [_2 ( - rc(7rs *'' A/ * ) ~ £ J 
such that 

Setting, for every i £ E, T\ :— T\ and using the abbrevi- 
ation Si := ei(l+^-2- l ^o^k)-^)) -observe that Si \ 0- 
we get 

F^,n\oMf)>l-8 u (38) 



lim - log dim T\ = I c (-Kg k ,N k )- 

l — >oo t 

Let m,t £ N be arbitrary and let the pure state oj( mL > 
and projections Pi, m L, ■ ■ ■ ,PN.mL be as described in the- 
orem ITOl Take x mL £ H® mL such that cj( mL ) = 
\xmL)(x m L\ and define a set of measurement operations 
R\ nL £ &(K® mL ,H® mL ) by Rf L ( ■ ) := J mL hr{p^ mL ■ 

Pi,mL)- 

Set 



n 



N 



•F riiL+t '■— ' ^mi} <8> Fj? ■ 

With the abbreviation 

F l e := F e (vr^ !i+t ,^ mL+t o AA® (mL+t) ) 

we arrive at 

F* = F e (mL) ' ° K®" 11 ) ^ ,T^°K Pt ) 

> F e (J mL \Rl nL o M® mL )F e {n fl ,n\ o A/?*) 

> (a; m L^ (mi) tr( K , mL M( W ( mL ))) a ;„ iL )(l - <5 t ) 

> (l-TV/" 1 )^!-^), (39) 



Replacing, for sufficiently large I £ N, m by [•s/Ij , i by 
, we obtain a sequence of (I, fc;)-codes for 3 with 
informed encoder that achieves 



min J c (7rg., Afj) - e. 

l<j<N 3 J 

Since this holds true for every e > we know that for ev- 
ery set {wg 1 , . . . , ttq n } the number mini<j<jv Ici^g, >Nj) 
is an achievable rate. 

Let {<r\ m) }i<i< N be such that 7 c (^ m) ,M® m ) = 
max p(m) 7 c (p( m ),A^® m ) for all i and let (4™| 4 Wn be 
the sequence of frequency typical states associated to 
(crj™ ' )neN' We get the following chain of inequalities. 

Q IE (3) ^ - lim -Q IE (3^ nm ) 
m n— >oo n 

> -lim lim - min IJn^J,, (jVf ro )®") 
( 3 ) • 1 ,. ,. 1 , ,_(m) 



min -lim lim -J c «7,-, (A/? "T") 



l<j<jv m 

(4) . I 

i<j<N m 

(5) . 1 

= mm — max 

Afj &3 m p( m ) 



T 1 (m) <r(g)r, 



I c {p^ m \Mf m ). 



Here, (1) is easily seen using using the fact that pure 
states can be transmitted with entanglement fidelity one 
over any channel by using a constant recovery operation. 
(2) has just been proven, while (3) is just continuity of the 
function min. In (4), we use the variant of BSST lemma 
IVI.2l for single channel. The last line (5) follows by choice 
of c| m ' . Taking the limit m — > 00 proves the claim. 

□ 



V. NETS AND DISCRETE APPROXIMATION 
IN THE SET OF QUANTUM CHANNELS 

The purpose of this section is to collect and provide 
necessary building blocks for discretization arguments to 
follow in section I VII We start with the existence of good 
discrete approximation in the set C(TL,K.) with respect 
to the diamond norm || • ||<> in subsection IV Al The rea- 
sons for using this norm are that a) it is multiplicative 
with respect to tensor products of channels, and b) that 
it is easy to relate to the entanglement fidelity and co- 
herent information. These relations are made precise in 
subsection IV Bl 



A. Existence of Good Nets 

Let C(H,K) denote the set of CPTP maps with do- 
main B(Tt) and range B(JC). Without loss of generality 
we assume that 7i = C d and JC = C d . The constructions 
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to follow will be based on the diamond norm on the set 
of channels. It is defined by the prescription 



sup max \\(id n ® .AT) (a) IN, (40) 

„ e NaGB(C"®-H),||a||i = l 



where 1 1 • 1 1 1 stands for the trace norm, id n 
B(C n ) is the identity channel, and TV : B{H) B{K) is 
any linear map, not necessarily completely positive. 1 1 • 1 1<> 
is the dual norm to 1 1 • | | c &, the norm of complete bounded- 
ness. We refer to [23[ and for additional information 
on these norms. It is an important fact that we need not 
maximize over all non- negative integers in (|40p , the norm 
stabilizes at the value n = d = dim7i, i.e. 



max \\(id d ®Af)(a)\\i. (41) 

aeB(C d ®-H),|!a|| 1 = l 



Two different proofs of this fact can be found in [23j and 

It is well known, and not difficult to show, that for M E 
C(Tt,lC) the relation 



1 



holds. Consequently, C(H,K.) C {$ : B{H) -> B{K) : 
$ is linear and 1 1 ^E* 1 1 o = 1} =: Sq . Moreover, it is clear 
that C(H,IC) is a compact convex set. 
A r-net, r > 0, in C(H,JC) is a set C C(H,JC) 

such that for each M € C(H,IC) there is at least one 

*g{i,." ,n} with |pv-jy/i||o < t. 

The following basic lemma shows the existence of t— nets 
in C(7i,K.) the cardinality of which grows polynomially 
in 1/t. 

Lemma V.l For any t E (0, 1] there is a r—net {Mi}^L 1 
in C(H,K.) with N < (|) 2(d ' d ' )2 ; where d = dimH and 
d! = dim/C. 

Proof. The lemma is p roved by simply imitating the proof 
of Lemma 2.6 in [25| where the corresponding result is 
shown for spheres in arbitrary finite dimensional normed 
spaces. We give the full argument for convenience. 
Let {M.i}fL l be an arbitrary subset of C(7i,/C) with the 
property that 

for all i 7^ j, i, j E {1, . . . , M}. We will establish an upper 
bound on the integer M now. 

The open balls B§(A4i, |), i = 1, . . . , M, with centers at 
Mi and radii r/2 are mutually disjoint and are contained 
in the ball B (0, 1 + §) since C{U,K) C S . So, 



M 

U^0(M i ,J)cB <> (Q > l + 5) 



(42) 



Let be the Borel-Lebesgue measure (or equivalently 
the Haar measure) on (B(B(H), B(K)), ^Borel) where 
B{B{TL),B{fC)) denotes the set of linear maps from B(H) 



to B(JC) and Y^Borei is the a— algebra of Borel sets. Com- 
puting the volume of the sets in (|4"2"1) we obtain 

M-(-J ^(0,1)) < (l + -j /x(So(0,l)), 

where B$(0, 1) is the open unit ball with respect to the 
0-norm and 2(d-d') 2 is the dimension of B{B{H),B(K)) 
as a vector space over the field R. This last inequality is 
equivalent to 

M< (1 + -J ■ (43) 

Now, let {A/ij^j be a maximal set in C(TC,IC) with the 
property that | \Ni—Nj \ |<> > r for all i ^ j. Then, clearly, 
{■/Vi}£Li ^ s a r— ne t and holds. Due to our assump- 
tion that r E (0, 1] we obtain 



V - | 1 + - 



2(d-d') 2 



I 3 - 



2{d-d') 2 



and we arc done. 



□ 



B. Discrete Approximation of Compound 
Quantum Channels 

Let 3 C C(Tt,K.) be an arbitrary set. Starting from a 
r/2-net m := {/VJ^ with N < {±f^') 2 as in Lemma 
IV.ll we can build a r/2— net J' T that is adapted to the set 
3 given by 

3' T := {Mi E m : 3Af E 3 with \\JV - JV t \\ < r/2}. 

(44) 

Let U E C(H,IC) be the useless channel given by U{p) :— 
-jrl/C: P £ S(H), and consider 

3 T :={{1- T -)N+ T -U:AT E3' T ), (45) 

where 3' T is defined in (|44l) . For 3 C C(r{, /C) we introduce 
the abbreviation 

I c {p,3) := m^I c (p,M), 

for p E S(TL). We list a few more or less obvious results 
in the following lemma that will be needed in the sequel. 



Lemma V.2 Let 3 C C(Tt,IC). For each positive t < - 
let 3 T be the finite set of channels defined in 

1. \3 T \ < (f )2(^') 2 

2. For JV E3 there is Mi E 3 T with 

\M® l -K® L \\o < lT - (46) 
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Consequently, for M ', Mi, and any CPTP map 1Z : 
B(JQ® 1 -» B{T) the relation 

\F e {p,noM® l )-F e { P ,noMf)\<iT (47) 

holds for all p £ S{H® 1 ) and I £ N. 
3. For all p £ S{TL) we have 

\I c (p,3) - I c (p,3 T )\ < t + 2r\og-. (48) 

T 

Proof. 1. This is clear from definition of 3 T . 

2. It is clear from construction of 3 T that there is at least 
one Mi £ 3 T with 

\\M - Mi^ < T. 

We know from [H,|24| that ||^i®^ 2 || = l^iHo'll^Ho 
holds, i.e. 0— norm is multiplicative. The inequality (|46]) 
is easily seen using repeatedly the tensor identity 

ai <g) bi - a 2 ® b 2 = a 1 g) (&i - 6 2 ) + (ai - 02) <8 &2, 

the multiplicativity of the 0— norm, and the fact that 
||ft||o = 1 for all CPTP maps. 

Let V € 7i m ® 7i m be a purification of p £ S(H® 1 ). 
Let us denote the left hand side of (|47|) by AF e . By the 
definition of the entanglement fidelity we have 

AF e = IfoMdf ® (fto (JV® 1 -JO(|$MM|. 

An application of the Cauchy-Schwarz inequality shows 
that 

Ai^ e < \\idf ®(1lo(M® l -M? l ))(\ij)(<lPM\ 

< \\idf^(iio(M^-MP i ))(mm\oo 

< \\idf®(Ko(M® i -M® i ))m(mi 

= \\(ldf ® ft) O ® (A^ ~ A/f )))(|^|)||l 

< \\n\\o\w® l -x®%\mm\i 

< It, 

where we have used ||7£||<> = 1, WV'XV'llli = L an d (j46l) . 

3. The proof of ([4*5]) is based on Fannes inequality [26| 
and uses merely standard conclusions. So, we will con- 
fine ourselves to a brief outline of the argument. Fannes 
inequality states that \S{a\) — S(a2)\ < rlogd — rlogr 
for all density operators with \\a% — 02II1 < t < 1/e. To 
the given r we can always find an M' £ 3 with 

I c (p,M')<I c (p,3)+T. (49) 

On the other hand there is Mi £ 3 T with 1 1 M' - Mi 1 1 < r. 
This implies immediately 

IPV'(p)-M(p)||i<r, 

and 

||irf d «> jv'd^xvi) — irfd«> ATidvX^DIIi <t 



by the definition of 0— norm where ip £ ri ® Ti is a pu- 
rification of yO £ S(H). Since 

/ c (p,A/"') = S(M'( P )) - S(id d ®M'(\i>)(ip\)) 

with a similar relation for Mi, an application of Fannes 
inequality leads to 

h{p,Mi) < Ic(p,M') + 2(rlogd - rlogr). 

This and J49]) show that 

Ic{p^r) < I c (p,3) + r + 2(rlogd- rlogr). 

The inequality 

40,3) < / c (p,3 T ) + 2(t log d- rlogr) 

is shown in a similar vein. 

□ 

VI. DIRECT PART OF THE CODING 
THEOREM: GENERAL CASE WITH INFORMED 
DECODER 

In this final section we will utilize our coding results for 
finite 3 and the discretization techniques from the previ- 
ous section to establish the coding theorem for arbitrary 
3 with informed decoder. 

Theorem VI. 1 Let 3 £ C(H,IC) be an arbitrary com- 
pound channel and let irg be the maximally mixed state 
associated with a subspace Q <Z"H. Then 

Q ID {3) > inf I c {n g ,M). 

Proof. We consider two subspaces F^G® 1 of H.® 1 with 
Ti C G m C H m . Let h ■= dim Ti and we denote as 
before the associated maximally mixed states on Ti and 
Q by 7rjr ; and irg. 

If infjv e a I c {-kq,M) < there is nothing to prove. There- 
fore we suppose that 

inf I c (ttg,M) > 

holds. We will show that for each e £ 
(0, fifing 3 I c {^g, M)) the number 

inf I c (ng,M) - s 
is an achievable rate. 

Let r > with r < = which will be specified later. We 
consider the finite set of channels 3 T :— {Mi, . . . ,Mn t } 
associated to 3 given in (|45|) with the properties listed in 
lemma IV. 21 We can conclude from the proof of theorem 
IIV.3l that for each I £ N there is a subspace Ti C Q® 1 of 
dimension 

ki= [2' (min ^( 1 Ar T }Jc(7r ,M)-f)j (50) 
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and for each A/i G 3 T a recovery operation IZi such that 

F e (-Kr v KioMf) > l-N T (2- l( - cS2 -' 1 ^ +2 - l ( c ' 52 - h '^ 

+2N T y/2- l ^-iW-M s »), ( 51 ) 

where we have chosen r > and 6 > small enough to 
ensure that 

min / C (7T0,M) - | > 0, 
ie{i,...,JV T } 2 



and 



7(5) - 3</j((5) > 0. 



By our construction of 3 T we can find to each TV € 3 at 
least one TV G 3 r with 



|F e (7r^ , Hi o TV®') - F e (7r^ , ^ o TV®')I < I ■ 



(52) 



according to lemma l*VT2l Note that by the last claim of 
lemma IV. 21 we also have the following estimate on the 
dimension fc; of the subspace T\\ 



ki > {2 l( - inlMe3 / <=(' r S:- / ^)-f-T-2r log |j ^ 



(53) 



The relations ([52j) , fl51]), and (J53|) lead to the following 
conclusion: For each leNwe can find a subspace Ti C 
g®l c n ®i of dimension 



> L2 



2(infjV e3 / c (7r s ^)-f-r-2rlog f 



(54) 



such that for each TV £ 3 there is a recovery operation 
7£y\/ with 



F e (ir n ,K A r°N'® 1 ) > 1 - TV r (2/V r ^2- l ^-i(^v(S)) 

_l_2^'( c <5 2 — ^(0) _|_ 2-'( c ' i52 -' i '(0)) 

-Zr. (55) 

Finally if we choose our approximation parameter r in 
dependence of I S N we can ensure that the entangle- 
ment fidelity of our code approaches one: Taking e.g. 
any sequence (t;);£n with r; \ as I tends to oo, and 
lim/^oo I ■ ti = we see immediately that the RHS of 
([55]) tends to one, and that the RHS of (|54|) can be lower- 
bounded by 

|^2 ; ( inf -ve3 -TcOs:A0-£)j 



for all sufficiently large I € N. 



□ 



The following lemma is the compound analog of a result 
discovered by Bennett, Shor, Smolin, and Thapliyal in 
[27l ] (BSST lemma for short). We will use the following 
abbreviations: For any 3 C C(7Y,/C) and / € N we set 

3® 1 := {TV 8 ' : TV e 3} . 

Recall also our earlier shortcut notation 



J c (p,a) = MI c (p,AT) 



for pe 5(H). 



Lemma VI.2 (Compound BSST Lemma) Let 3 C 

C(7i,/C) 6e an arbitrary set of channels. For any p £ 
S(TC) letqs : i £ B(ri® 1 ) be the frequency-typical projection 
of p and set 



^5,1 '■- 



tr{qs,i) 



Then there is a positive sequence (Si)i^ satisfying 
limj^oo Si = with 

lim ~ inf I c {-K 5uU tf® 1 ) = inf I c (p,TV). 

Proof. The proof is via reduction to Holevo's proof [28[ 
of the BSST lemma for single channel supplemented by a 
discretization argument. As in the proof of theorem lVI.il 
we choose a decreasing sequence (t;); s n, with 77 > 0, 
lim/^oo It 1 — 0, and consider the finite set of channels 
3 Tl = {TVi, . . . ,TVjv } defined in (jlij)) associated to 3. 
By our construction of the set 3 n we know that 



M(p) > ^1, 

holds for all i € { 1 , . . . , N n } , which implies 



log TV (p) > log 



n 

d'2 



(56) 



(57) 



uniformly in i £ {1, . . . , TV T| }. On the other hand let 
J Tli e = {£1, ■ ■ ■ £n Ti } C C(H, H e ) denote the complemen- 
tary set of channels associated to 3 Tl . We alter 3 n je by 
mixing a part of useless channel U e £ C(Ti 1 Ti e ) to each 



S 3 T! ,e, i-e. we set 



Note that then for each £' £ 3' 



3(p) > 



Tl 



2 dim(H e 



and consequently 

log S'^p) > log 



2dim(H e ) 



(58) 



Applying Holevo's argument from [23] to each channel 
from 3 n and 3' e with our notation from lemma IIV.1I 



and uniform bounds (|57|) . (|58|) we obtain for each i e 
{l,...,TV Ti } 



~S(T^,0)-S(75(p)) 



< -j\og m (5) + 2<p(5) 
= : @i(6,D) (59) 



where (T U D) £ {(Afi,d'), (£-,dimH e )}- 
Since for each ie {1, . . . , N Tl } 

m-£i\\o<n, 
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we obtain 



\\£f l -£f%<ln. 



Hence choosing I sufficiently large we can ensure that 
In < and an application of Fannes inequality shows 
that 

\S(8M) - S(£i(p))\ < Tl log^^, (60) 



□ 

It is easy now to prove the direct part of the coding the- 
orem. 

First observe that for each k G N 



Q ID (3® k ) = kQ ID (3) 



(67) 



V 



and 



< Iri log 



dim Tie 

In 



Inequalities (j5"9"j) . (frJtJ)) . and (|6"Tj) show that 
1 



(61) 



< e ( (5,d') + e ; ((5,dim7i e ) 
dim Ti e 



holds. For any fixed p G S(?i® m ) let cftj G B(H® ml ) 
be the frequency-typical projection of p and set %s,l = 
. I s ' 1 \ . Then according to theorem I VI. II for each S G 

(0, 1/2) we have 

Q ID (3® ml ) >I c (ns,i,3® ml ). (68) 
From (H7J), dHHl), and lemma [VL2] we obtain 



Q/ D (3) = - lim yQ ZD (3® roi ) 

TO i-^oo I 



+Tl log ■ 



+^TJ log 



n 

dim7Y e 

In 



> I lim I inf j (7r (j^wm)^) 
to i^oo Z AA=3 

= -/ c ( P) a® m ). 



(69) 



A; (5 d' dimH ) (62) Obviously, (|69|) implies the following theorem: 



for each i G {1, ... ,N n }. It is then easily seen utilizing 
||B2J| that 



y/ c (7T 5>i , J®')--fc(P. ?n) 



< Aj(5,d',dimW e ). (63) 



Theorem VI. 3 (Direct Part: Informed Decoder) 

Let 3 C C(7i,/C). TTien i/ie capacity of the compound 
quantum channel built up from 3 in the scenario with 
informed decoder satisfies 



1 



Applying ([351 ) to p, 3 and tt^,;, 3®' we obtain 



Qid(3) > lim — max inf I c (p. 

(^oo Z P es(H® 1 ) Asa 



|7 c (p, J Ti ) - J c (p, 3)| < ti + 2n log ■ 



n 



(64) 



and 



<Ti+2l n \og—. (65) 



Using triangle inequality, (f6"3"|) , , and (|65|) we see that 



< Ai(S,d',dkaH e ) 
d 



-n + 21ti log 



In 



+n + 2 n log-. (66) 
n 

We are done now, since for any positive sequence 
{Si)ieN with lirm-foo 5 t = 0, )imi- too rn(8i) = 1, and 
limj^oo 5i log 77 = we can conclude from (f6"6) that 

lim y inf I c {k 5uU N® 1 ) = inf I c (p,A0- 

holds. 



Since the converse to the coding theorem IVI.3I follows 
easily from the converse part to the coding theorem for 
single memoryless channel @ we can state our capacity 
result as follows: 

Theorem VI. 4 (Capacity with Informed Decoder) 

The capacity of the compound channel built up from a 
set 3 C C(7i.JC) with informed decoder is given by 

Qid{3) = lim \ max inf IJp,J\f® 1 ). 
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